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3 <110> APPLICANT: Transkaryot ic Therapies, Inc. 

4 von Figura, Kurt 

5 Schmidt, Bernhard 

6 Dierks, Thomas 

7 Heartlein, Michael W. 

8 Cosma, Maria P. 

9 Ballabio, Andrea 

11 <120> TITLE OF INVENTION: DIAGNOSIS AND TREATMENT OF MULTIPLE SULFATASE DEFICIENCY AND 

12 OTHER SULFATASE DEFICIENCIES 

14 <130> FILE REFERENCE: 10278-048001 ^ 
C--> 16 <140> CURRENT APPLICATION NUMBER: US/10/775 , 678A ^ ) 

17 <141> CURRENT FILING DATE: 2004-02-10 r f\&\^ / 

19 <150> PRIOR APPLICATION NUMBER: US 60/447,747 (jv (\ 

20 <151> PRIOR FILING DATE: 2003-02-11 * ^ 
22 <160> NUMBER OF SEQ ID NOS : 96 

24 <170> SOFTWARE: Patentln version 3.2 

26 <210> SEQ ID NO: 1 

27 <211> LENGTH: 1180 

28 <212> TYPE: DNA 

29 <213> ORGANISM: Homo sapiens 

32 <220> FEATURE: 

33 <221> NAME/KEY: CDS 

34 <222> LOCATION: (20) .. (1141) 

36 <400> SEQUENCE: 1 

37 acatggcccg cgggacaac atg get gcg ccc gca eta ggg ctg gtg tgt gga 52 
3 8 Met Ala Ala Pro Ala Leu Gly Leu Val Cys Gly 

39 15 10 

41 cgt tgc cct gag ctg ggt etc gtc etc ttg ctg ctg ctg etc teg ctg 100 

42 Arg Cys Pro Glu Leu Gly Leu Val Leu Leu Leu Leu Leu Leu Ser Leu 

43 15 20 25 

45 ctg tgt gga gcg gca ggg age cag gag gee ggg ace ggt gcg ggc gcg 148 

46 Leu Cys Gly Ala Ala Gly Ser Gin Glu Ala Gly Thr Gly Ala Gly Ala 

47 30 35 40 

49 999 tcc ctt 9 C 9 99t tct tgc ggc tgc ggc acg ccc cag egg cct ggc 196 

50 Gly Ser Leu Ala Gly Ser Cys Gly Cys Gly Thr Pro Gin Arg Pro Gly 

51 45 50 55 

53 gee cat ggc agt teg gca gee get cac cga tac teg egg gag get aac 244 

54 Ala His Gly Ser Ser Ala Ala Ala His Arg Tyr Ser Arg Glu Ala Asn 

55 60 65 70 75 

57 get ccg ggc ccc gta ccc gga gag egg caa etc gcg cac tea aag atg 292 

58 Ala Pro Gly Pro Val Pro Gly Glu Arg Gin Leu Ala His Ser Lys Met 

59 80 85 90 

61 gtc ccc ate cct get gga gta ttt aca atg ggc aca gat gat cct cag 340 
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62 Val Pro lie Pro Ala Gly Val Phe Thr Met Gly Thr Asp Asp Pro Gin 

63 95 100 105 

65 ata aag cag gat ggg gaa gca cct gcg agg aga gtt act att gat gcc 3 88 

66 lie Lys Gin Asp Gly Glu Ala Pro Ala Arg Arg Val Thr lie Asp Ala 

67 110 115 120 

69 ttt tac atg gat gcc tat gaa gtc agt aat act gaa ttt gag* aag ttt 436 

70 Phe Tyr Met Asp Ala Tyr Glu Val Ser Asn Thr Glu Phe Glu Lys Phe 

71 125 130 135 

73 gtg aac tea act ggc tat ttg aca gag get gag aag ttt ggc gac tec 484 

74 Val Asn Ser Thr Gly Tyr Leu Thr Glu Ala Glu Lys Phe Gly Asp Ser 

75 140 145 150 155 

77 ttt gtc ttt gaa ggc atg ttg agt gag caa gtg aag acc aat att caa 532 

78 Phe Val Phe Glu 'Gly Met Leu Ser Glu Gin Val Lys Thr Asn He Gin 

79 160 165 170 

81 cag gca gtt gca get get ccc tgg tgg tta cct gtg aaa ggc get aac 580 

82 Gin Ala Val Ala Ala Ala Pro Trp Trp Leu Pro Val Lys Gly Ala Asn 

83 175 ' 180 185 

85 tgg aga cac cca gaa ggg cct gac tct act att ctg cac agg ccg gat 62 8 

86 Trp Arg His Pro Glu Gly Pro Asp Ser Thr He Leu His Arg Pro Asp 

87 190 195 200 

89 cat cca gtt etc cat gtg tec tgg aat gat gcg gtt gcc tac tgc act 676 

90 His Pro Val Leu His Val Ser Trp Asn Asp Ala Val Ala Tyr Cys Thr 

91 205 210 215 

93 tgg gca ggg aag egg ctg ccc acg gaa get gag tgg gaa tac age tgt 724 

94 Trp Ala Gly Lys Arg Leu Pro Thr Glu Ala Glu Trp Glu Tyr Ser Cys 

95 220 225 230 235 

97 cga gga ggc ctg cat aat aga ctt ttc ccc tgg ggc aac aaa ctg cag 772 

98 Arg Gly Gly Leu His Asn Arg Leu Phe Pro Trp Gly Asn Lys Leu Gin 

99 240 245 250 

101 ccc aaa ggc cag cat tat gcc aac att tgg cag ggc gag ttt ccg gtg 820 

102 Pro Lys Gly Gin His Tyr Ala Asn He Trp Gin Gly Glu Phe Pro Val 

103 255 260 265 

105 acc aac act ggt gag gat ggc ttc caa gga act gcg cct gtt gat gcc 868 

106 Thr Asn Thr Gly Glu Asp Gly Phe Gin Gly Thr Ala Pro Val Asp Ala 

107 270 275 280 

109 ttc cct ccc aat ggt tat ggc tta tac aac ata gtg ggg aac gca tgg 916 

110 Phe Pro Pro Asn Gly Tyr Gly Leu Tyr Asn He Val Gly Asn Ala Trp 

111 285 290 295 

113 gaa tgg act tea gac tgg tgg act gtt cat cat tct gtt gaa gaa acg 964 

114 Glu Trp Thr Ser Asp Trp Trp Thr Val His His Ser Val Glu Glu Thr 

115 300 305 310 315 

117 ctt aac cca aaa ggt ccc cct tct ggg aaa gac cga gtg aag aaa ggt 1012 

118 Leu Asn Pro Lys Gly Pro Pro Ser Gly Lys Asp Arg Val Lys Lys Gly 

119 320 325 330 

121 gga tec tac atg tgc cat agg tct tat tgt tac agg tat cgc tgt get 1060 

122 Gly Ser Tyr Met Cys His Arg Ser Tyr Cys Tyr Arg Tyr Arg Cys Ala 

123 335 340 345 

125 get egg age cag aac aca cct gat age tct get teg aat ctg gga ttc 1108 

126 Ala Arg Ser Gin Asn Thr Pro Asp Ser Ser Ala Ser Asn Leu Gly Phe 
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<210> SEQ ID NO 


: 2 
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<211> LENGTH: 374 
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<212> TYPE: 
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*Homo sapiens 
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220 
223 
224 
227 
228 
231 
232 
235 
236 



305 

Pro Pro Ser Gly Lys 
325 

His Arg Ser Tyr Cys 
340 

Thr Pro Asp Ser Ser 
355 

Arg Leu Pro Thr Met 
370 



330 

Tyr Arg Tyr Arg Cys Ala 
345 

Ala Ser Asn Leu Gly Phe 
360 

Asp 



310 315 
Asp Arg Val Lys Lys Gly 



320 

Gly Ser Tyr Met Cys 
335 

Ala Arg Ser Gin Asn 
350 

Arg Cys Ala Ala Asp 
365 



239 <210> SEQ ID NO: 3 

240 <211> LENGTH: 1122 

241 <212> TYPE: DNA 

242 <213> ORGANISM: Homo sapiens 

244 <400> SEQUENCE: 3 

245 atggctgcgc ccgcactagg gctggtgtgt ggacgttgcc ctgagctggg tctcgtcctc 60 
247 ttgctgctgc tgctctcgct gctgtgtgga gcggcaggga gccaggaggc cgggaccggt 12 0 
249 gcgggcgcgg ggtcccttgc gggttcttgc ggctgcggca cgccccagcg gcctggcgcc 180 
251 catggcagtt cggcagccgc tcaccgatac tcgcgggagg ctaacgctcc gggccccgta 240 
253 cccggagagc ggcaactcgc gcactcaaag atggtcccca tccctgctgg agtatttaca 300 
255 atgggcacag atgatcctca gataaagcag gatggggaag cacctgcgag gagagttact 360 
257 attgatgcct tttacatgga tgcctatgaa gtcagtaata ctgaatttga gaagtttgtg 420 
259 aactcaactg gctatttgac agaggctgag aagtttggcg actcctttgt ctttgaaggc 480 
261 atgttgagtg agcaagtgaa gaccaatatt caacaggcag ttgcagctgc tccctggtgg 540 
263 ttacctgtga aaggcgctaa ctggagacac ccagaagggc ctgactctac tattctgcac 600 
265 aggccggatc atccagttct ccatgtgtcc tggaatgatg cggttgccta ctgcacttgg 660 
267 gcagggaagc ggctgcccac ggaagctgag tgggaataca gctgtcgagg aggcctgcat 720 
269 aatagacttt tcccctgggg caacaaactg cagcccaaag gccagcatta tgccaacatt 780 
271 tggcagggcg agtttccggt gaccaacact ggtgaggatg gcttccaagg aactgcgcct 840 
273 gttgatgcct tccctcccaa tggttatggc ttatacaaca tagtggggaa cgcatgggaa 900 
275 tggacttcag actggtggac tgttcatcat tctgttgaag aaacgcttaa cccaaaaggt 960 
277 cccccttctg ggaaagaccg agtgaagaaa ggtggatcct acatgtgcca taggtcttat 1020 
279 tgttacaggt atcgctgtgc tgctcggagc cagaacacac ctgatagctc tgcttcgaat 1080 
281 ctgggattcc gctgtgcagc cgaccgcctg cccaccatgg ac 1122 

284 <210> SEQ ID NO: 4 

285 <211> LENGTH: 2130 

286 <212> TYPE: DNA 

287 <213> ORGANISM: Homo sapiens 

289 <400> SEQUENCE: 4 

290 acatggcccg cgggacaaca tggctgcgcc cgcactaggg ctggtgtgtg gacgttgccc 60 
292 tgagctgggt ctcgtcctct tgctgctgct gctctcgctg ctgtgtggag cggcagggag 120 
294 ccaggaggcc gggaccggtg cgggcgcggg gtcccttgcg ggttcttgcg gctgcggcac 180 
296 gccccagcgg cctggcgccc atggcagttc ggcagccgct caccgatact cgcgggaggc 240 
298 taacgctccg ggccccgtac ccggagagcg gcaactcgcg cactcaaaga tggtccccat 300 
300 ccctgctgga gtatttacaa tgggcacaga tgatcctcag ataaagcagg atggggaagc 3 60 
302 acctgcgagg agagttacta ttgatgccct ttacatggat gcctatgaag tcagtaatac 420 
304 tgaatttgag aagtttgtga actcaactgg ctatttgaca gaggctgaga agtttggcga 480 
306 ctcctttgtc tttgaaggca tgttgagtga gcaagtgaag accaatattc aacaggcagt 540 
308 tgcagctgct ccctggtggt tacctgtgaa aggcgctaac tggagacacc cagaagggcc 600 
310 tgactctact attctgcaca ggccggatca tccagttctc catgtgtcct ggaatgatgc 660 
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312 ggttgcctac tgcacttggg cagggaagcg gctgcccacg gaagctgagt gggaatacag 72 0 
314 ctgtcgagga ggcctgcata atagactttt cccctggggc aacaaactgc agcccaaagg 780 
316 ccagcattat gccaacattt ggcagggcga ttttccggtg accaacactg gtgaggatgg 840 
318 cttccaagga actgcgcctg ttgatgcctt ccctcccaat ggttatggct tatacaacat 900 
32 0 agtggggaac gcatgggaat ggacttcaga ctggtggact gt teat cat t ctgttgaaga 960 
322 aacgettaac ccaaaaggtc ccccttctgg gaaagaccga gtgaagaaag gtggatccta 1020 
324 catgtgccat aggtcttatt gttacaggta tcgctgtgct gctcggagcc agaacacacc 1080 
326 tgatagctct gcttcgaatc tgggattccg ctgtgcagcc gaccgcctgc ccaccatgga 1140 
328 ctgacaacca agggtagtct tccccagtcc aaggagcagt cgtgtctgac ctacattggg 1200 
330 ctttcctcag aactttgaac gatcccatgc aaagaattcc caccctgagg tgggttacat 1260 
332 acctgcccaa tggecaaagg aaccgccttg tgagaccaaa ttgetgaect gggtcagtgc 1320 
334 atgtgcttta tggtgtggtg catctttgga gatcatcacc atattttact tttgagagtc 1380 
336 tttaaagagg aaggggagtg gagggaaccc tgagctaggc ttcaggaggc ccgcatccta 1440 
338 cgcaggctct gecacagggg ttagacccca ggtccgacgc ttgaccttcc tgggcctcaa 1500 
340 gtgccctccc ctatcaaatg aaggaatgga cagcatgacc tctgggtgtc tctccaactc 1560 
342 accagttcta aaaagggtat cagattctat tgtgacttca tagaatttat gatagattat 1620 
344 tttttagcta ttttttccat gtgtgaacct tgagtgatac taatcatgta aagtaagagt 1680 
346 tctcttatgt attatgttcg gaagaggggt gtggtgactc ctttatattc gtactgeact 1740 
348 ttgtttttcc aaggaaatca gtgtctttta cgttgttatg atgaatccca catggggccg 1800 
350 gtgatggtat gctgaagttc agccgttgaa cacataggaa tgtctgtggg gtgactctac 1860 
352 tgtgctttat cttttaacat taagtgcctt tggttcagag gggcagtcat aagctctgtt 1920 
354 tccccctctc cccaaagcct teagegaacg tgaaatgtgc getaaaeggg gaaacctgtt 1980 
356 taattctaga tatagggaaa aaggaacgag gaccttgaat gagctatatt cagggtatcc 2040 
358 ggtattttgt aatagggaat aggaaacctt gttggctgtg gaatatccga tgctttgaat 2100 
360 catgeactgt gttgaataaa egtatctget 2130 

363 <210> SEQ ID NO: 5 

364 <211> LENGTH: 374 

365 <212> TYPE: PRT 

366 <213> ORGANISM: Homo sapiens 
368 <400> SEQUENCE: 5 

370 Met Ala Ala Pro Ala Leu Gly Leu Val Cys Gly Arg Cys Pro Glu Leu 

371 15 10 15 

3 74 Gly Leu Val Leu Leu Leu Leu Leu Leu Ser Leu Leu Cys Gly Ala Ala 
375 20 25 30 

3 78 Gly Ser Gin Glu Ala Gly Thr Gly Ala Gly Ala Gly Ser Leu Ala Gly 
379 35 40 45 

382 Ser Cys Gly Cys Gly Thr Pro Gin Arg Pro Gly Ala His Gly Ser Ser 

383 50 55 60 

386 Ala Ala Ala His Arg Tyr Ser Arg Glu Ala Asn Ala Pro Gly Pro Val 

387 65 70 75 80 

390 Pro Gly Glu Arg Gin Leu Ala His Ser Lys Met Val Pro lie Pro Ala 

391 85 90 95 

3 94 Gly Val Phe Thr Met Gly Thr Asp Asp Pro Gin lie Lys Gin Asp Gly 
395 100 105 110 

398 Glu Ala Pro Ala Arg Arg Val Thr lie Asp Ala Leu Tyr Met Asp Ala 

399 115 120 125 

402 Tyr Glu Val Ser Asn Thr Glu Phe Glu Lys Phe Val Asn Ser Thr Gly 

403 130 135 140 

406 Tyr Leu Thr Glu Ala Glu Lys Phe Gly Asp Ser Phe Val Phe Glu Gly 
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PATENT APPLICATION: US/10/775 , 678A 
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Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing, Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 



Seq#:32; Xaa Pos . 1,2,3 

Seq#:79; Xaa Pos . 3,4,6 

Seq#:81; N Pos. 590,626 

Seq#:82; N Pos . 690,755 

Seq#:83; N Pos. 6,47,81 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/10/775 , 678A 



DATE: 03/14/2007 
TIME: 10:46:32 



Input Set : A:\10278-048001.txt 

Output Set: N:\CRF4\03142007\J775678A.raw 



L:16 M:270 C: Current Application Number differs, Replaced Current Application Number 

L:3704 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:32 after pos . : 0 

L:6087 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:79 after pos . : 0 

L:6155 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:81 after pos.:540 
M:341 Repeated in SeqNo=81 

L:6199 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:82 after pos.:660 
M:341 Repeated in SeqNo=82 

L:6226 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:83 after pos . : 0 
M:341 Repeated in SeqNo=83 
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